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RAW SEQUENCE LISTING DATE: 10/22/2004 

PATENT APPLICATION: US/10/787 , 2 67A TIME : 12 : 50 : 01 

Input Set : A:\GC687-3-Dl-seqlist-corr.txt 
Output Set: N:\CRF4\10222004\J787267A.raw 



4 < 11 0 > APPLICANT: Dartois, Veronique A. 

5 Hoch, James A. 

6 Valle, Fernando 

7 Kumar, Manoj 

9 <12 0> TITLE OF INVENTION: 2, 5 -DKG Permeases 

12 <13 0> FILE REFERENCE: GC687-3-D1 

14 <140> CURRENT APPLICATION NUMBER: US 10/787, 267A 

15 <141> CURRENT FILING DATE: 2004-02-25 

17 <150> PRIOR APPLICATION NUMBER: US 09/922,501 

18 <151> PRIOR FILING DATE: 2001-08-03 

20 <150> PRIOR APPLICATION NUMBER: US 60/325,774 

21 <151> PRIOR FILING DATE: 2000-08-04 

23 <150> PRIOR APPLICATION NUMBER: US 60/421,141 

24 <151> PRIOR FILING DATE: 2000-09-29 
26 <160> NUMBER OF SEQ ID NOS: 22 

28 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

30 <210> SEQ ID NO: 1 

31 <211> LENGTH: 1500 

32 <2 12 > TYPE: DNA 

33 <213> ORGANISM: Unknown 

35 <220> FEATURE: 

36 <223 > OTHER INFORMATION: environmental source 

39 <220> FEATURE: 

40 <221> NAME/KEY: CDS 

41 <222> LOCATION: ( 94 ) . . . ( 13 74 ) 

43 <400> SEQUENCE: 1 



44 ggcgaatagc ccggccggcg tcataataac ggccttctct gtaccctaca tacggcggcg 

45 gcgtcatgaa cctcaacttt agtaggcaag cct atg aac age tet acc aat gca 

46 Met Asn Ser Ser Thr Asn Ala 





aeg 


aaa 


ege 


tgg 


tgg 


tac 


ate 


atg 


50 


Thr 


Lys 


Arg 


Trp 


Trp 


Tyr 


He 


Met 


51 






10 










15 


53 


ctg 


geg 


tat 


etc 


gac 


ege 


gca 


aac 


54 


Leu 


Ala 


Tyr 


Leu 


Asp 


Arg 


Ala 


Asn 


55 




25 










30 




57 


att 


aeg 


gaa 


gat 


tta 


ggc 


att 


acc 


58 


lie 


Thr 


Glu 


Asp 


Leu 


Gly 


lie 


Thr 


59 


40 










45 






61 


gca 


ett 


ttc 


ttc 


etc 


ggc 


tat 


ttc 


62 


Ala 


Leu 


Phe 


Phe 


Leu 


Gly 


Tyr 


Phe 


63 










60 








65 


tac 


geg 


gaa 


ege 


cgt 


age 


gta 


egg 



cct 


ate 


gtg 


ttt 


ate 


aeg 


tat 


age 


Pro 


lie 


Val 


Phe 


lie 


Thr 


Tyr 


Ser 










20 








ttc 


age 


ttt 


get 


teg 


gca 


geg 


ggc 


Phe 


Ser 


Phe 


Ala 


Ser 


Ala 


Ala 


Gly 








35 








aaa 


99c 


ate 


teg 


teg 


ett 


ett 


ggc 


Lys 


Gly 


lie 


Ser 


Ser 


Leu 


Leu 


Gly 






50 










55 


ttc 


ttc 


cag 


ate 


ccg 


ggg 


geg 


att 


Phe 


Phe 


Gin 


lie 


Pro 


Gly 


Ala 


lie 




65 










70 




aag 


ctg 


att 


ttc 


ate 


tgt 


ctg 


ate 



60 

114 

162 

210 

258 

306 

354 



file://C:\CRF4\OUTHOLD\VsrJ787267A.htm 



10/22/04 





Page 2 of 7 



RAW SEQUENCE LISTING DATE: 10/22/2004 
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66 Tyr 

67 

6 9 ctg 

70 Leu 

71 . 

73 age 

74 Ser 

75 

77 gtc 

78 Val 

79 120 

81 gaa 

82 Glu 

83 

85 gta 

86 Val 

87 

8 9 tgg 

9 0 Trp 
91 

93 ttc 

94 Phe 

95 

97 ttg 

100 Leu 

101 200 

103 cag 

104 Gin 

105 

107 eg c 

108 Arg 

109 

111 gtg 

112 Val 

113 

115 gtc 

116 Val 

117 

119 ctg 

120 Leu 

121 280 

123 atg 

124 Met 

125 

127 ctg 

128 Leu 

129 

131 tet 

132 Ser 



Ala Glu 

tgg ggc 
Trp Gly 
90 

get ggc 
Ala Gly 
105 

atg ccg 
Met Pro 

cgt tea 
Arg Ser 

G tg tgg 
Leu Trp 

cgt gaa 
Arg Glu 
170 
tgc tgg 
Cys Trp 
185 

teg gaa 
Ser Glu 

cag ggt 
Gin Gly 

aac gtc 
Asn Val 

tac ggt 
Tyr Gly 
250 
aat atg 
Asn Met 
265 

gcc geg 
Ala Ala 

cag aac 
Gin Asn 

get ttt 
Ala Phe 

tat acc 
Tyr Thr 



Arg Arg 
75 

gcc tgc 
Ala Cys 

tgg ega 
Trp Arg 

geg atg 
Ala Met 

ege gcc 
Arg Ala 
140 
atg teg 
Met Ser 
155 

atg ttt 
Met Phe 

tgg gtg 
Trp Val 

aac gag 
Asn Glu 



Ser Val Arg 



gcc 

Ala 

tee 

Ser 

ctg 

Leu 

125 

aac 

Asn 



teg 

Ser 

gtt 

Val 

110 

att 

lie 



ett 

Leu 

95 

tta 

Leu 

tac 

Tyr 



Lys Leu lie 
80 

gac egg gat 
Asp Arg Asp 

ttc teg get 
Phe Ser Ala 



acc ttc 
Thr Phe 



att 

He 

att 

lie 

235 

ttt 

Phe 



aaa 

Lys 

220 

eta 

Leu 

gtg 

Val 



gtg gtc tee 
Val Val Ser 

att att gaa 
lie lie Glu 
175 

ctg gtc aaa 
Leu Val Lys 
190 

aaa gcc geg 
Lys Ala Ala 
205 

gee gtg cgt 
Ala Val Arg 

ctg tgc atg 
Leu Cys Met 



ate agt 
lie Ser 

tta ate 
Leu lie 
145 
ggc tac 
Gly Tyr 
160 

ggc gtt 
Gly Val 



aac 

Asn 

130 

etc 

Leu 

ctg 

Leu 

ccg 

Pro 



ggg atg 
Gly Met 

act att 
Thr lie 

cgt aaa 
Arg Lys 
300 
att ggc 
lie Gly 
315 

ctg ctg 
Leu Leu 



ctg 

Leu 

gtg 

Val 

geg 

Ala 

285 

Ctg 

Leu 



tgg ttg 
Trp Leu 
255 
gaa gtc 
Glu Val 
270 

atg ate 
Met lie 

ttc gtc 
Phe Val 



gtt aaa ccg 
Val Lys Pro 

ctg cag geg 
Leu Gin Ala 
210 

aac tac ggc 
Asn Tyr Gly 
225 

cag tat ttt 
Gin Tyr Phe 
240 

ccg tea att 
Pro Ser lie 

ggc tgg etc 
Gly Trp Leu 



gtc gtc 
Val Val 



tea tgg gcc 
Ser Trp Ala 

gtg att gcc 
Val lie Ala 



tgg 

Trp 

gtc 

Val 

320 

aat 

Asn 



ccg 

Pro 

305 

ggc 

Gly 

geg 

Ala 



tee 

Ser 

290 

ctg 

Leu 

get 

Ala 

gca 

Ala 



Phe lie 

ggt gca 
Gly Ala 
100 
gtc gta 
Val Val 
115 

tgg ttt 
Trp Phe 

ggc aac 
Gly Asn 

att cag 
lie Gin 

gcc gtc 
Ala Val 
180 
teg cag 
Ser Gin 
195 

cag ctg 
Gin Leu 

gaa gcc 
Glu Ala 

gee tgg 
Ala Trp 

att ege 
lie Arg 
260 
tet teg 
Ser Ser 
275 

tgg get 
Trp Ala 

ctg ctg 
Leu Leu 

aac cat 
Asn His 

atg tac 
Met Tyr 



Cys Leu lie 
85 

caa tat tee 
Gin Tyr Ser 

gaa geg geg 
Glu Ala Ala 



acc 

Thr 

ccg 

Pro 

tee 

Ser 

165 

etc 

Leu 



aaa 

Lys 

gtc 

Val 

150 

ttc 

Phe 



tea 

Ser 

135 

aeg 

Thr 

ggc 

Gly 



tgg gcc 
Trp Ala 



gtg aac tgg 
Val Asn Trp 

gag age gag 
Glu Ser Glu 
215 

ttc ege tea 
Phe Arg Ser 
230 

agt ate ggc 
Ser lie Gly 
245 

age ggc ggc 
Ser Gly Gly 

gtg cct tat 
Val Pro Tyr 



tee 

Ser 

att 

lie 

ttc 

Phe 

325 

gee 

Ala 



gat aaa 
Asp Lys 
295 
ggc gga 
Gly Gly 
310 

tgg gcc 
Trp Ala 

cct tac 
Pro Tyr 



402 
450 
498 
546 
594 
642 
690 
73 8 
786 
834 
882 
930 
978 
1026 
1074 
1122 
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RAW SEQUENCE LISTING DATE: 10/22/2004 

PATENT APPLICATION: US/10/7 87 , 2 6 7A TIME: 12:50:01 

Input Set : A:\GC687-3-Dl-seqlist-corr.txt 
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133 






330 




135 


ggt 


ccg 


ttt 


ttc 


136 


Gly Pro 


Phe 


Phe 


137 




345 






139 


ggt 


ggc 


gca 


atg 


140 


Gly Gly Ala 


Met 


141 


360 








143 


ttt 


ggt 


teg 


tgg 


144 


Phe 


Gly Ser 


Trp 


145 










147 


tea 


gcc 


tea 


tac 


148 


Ser 


Ala 


Ser 


Tyr 


149 








395 


151 


ett 


act 


tta 


att 


152 


Leu 


Thr 


Leu 


lie 


153 






410 




155 


get 


cgt 


cac 


gcc 


156 


Ala 


Arg 


His 


Ala 


157 




425 







gcc ate 
Ala He 

geg etc 
Ala Leu 
3 65 
ttc gtg 
Phe Val 
380 

att ttc 
lie Phe 

gtt aag 
Val Lys 



335 
att ccg 
lie Pro 
350 

ate aac 
lie Asn 

ggc tac 
Gly Tyr 

atg gga 
Met Gly 

ect get 
Pro Ala 
415 



gaa atg 
Glu Met 

age atg 
Ser Met 

ctg aac 
Leu Asn 
385 
gtg geg 
Val Ala 
400 

aac aat 
Asn Asn 



ctg ccg 
Leu Pro 
355 
ggg gcc 
Gly Ala 
370 

ggc acc 
Gly Thr 

ett ttc 
Leu Phe 

caa aag 
Gin Lys 



340 

cgt aac 
Arg Asn 

tta ggt 
Leu Gly 

acc ggc 
Thr Gly 

gcc teg 
Ala Ser 
405 
etc ccc 
Leu Pro 
420 



gtc gcc 
Val Ala 

tea ttc 
Ser Phe 
375 
agt cca 
Ser Pro 
390 

gta tgg 
Val Trp 

ate ggc 
lie Gly 



tgacctttac taettaegga gatcacgcct tgggtacgtt 



159 gcaggacaaa ccgataggca ccgcaaaggc tggggccatc gageagegeg taaacaqtca 

160 gctggttgct gtcgctgtgc ggcgtc 

162 <210> SEQ ID NO: 2 

163 <211> LENGTH: 427 

164 <212> TYPE: PRT 



165 <2 13 > ORGANISM: Unknown 

167 <220> FEATURE: 

168 <223 > OTHER INFORMATION 

170 <400> SEQUENCE: 2 

171 Met Asn Ser Ser Thr Asn 

172 1 5 

173 lie Val Phe lie Thr Tyr 

174 20 

175 Ser Phe Ala Ser Ala Ala 

176 35 

177 Gly lie Ser Ser Leu Leu 

178 50 

179 Phe Gin lie Pro Gly Ala 

180 65 70 

181 Leu lie Phe lie Cys Leu 

182 85 

183 Arg Asp Gly Ala Gin Tyr 

184 100 

185 Ser Ala Val Val Glu Ala 

186 115 

187 Ser Asn Trp Phe Thr Lys 

188 130 

189 lie Leu Gly Asn Pro Val 

190 145 150 

191 Tyr Leu lie Gin Ser Phe 



environmental source 

Ala Thr Lys Arg Trp Trp Tyr He Met Pro 
10 IS 

Ser Leu Ala Tyr Leu Asp Arg Ala Asn Phe 
25 30 

Q ly lie Thr Glu Asp Leu Gly lie Thr Lys 
40 45 

Gly Ala Leu Phe Phe Leu Gly Tyr Phe Phe 
55 60 

lie Tyr Ala Glu Arg Arg Ser Val Arg Lys 
75 80 

lie Leu Trp Gly Ala Cys Ala Ser Leu Asp 
90 95 

Ser Ser Ala Gly Trp Arg Ser Val Leu Phe 
105 HO 

Ala Val Met Pro Ala Met Leu lie Tyr lie 
120 125 

Ser Glu Arg Ser Arg Ala Asn Thr Phe Leu 
135 140 

Thr Val Leu Trp Met Ser Val Val Ser Gly 
155 160 

Gly Trp Arg Glu Met Phe lie lie Glu Gly 



1170 

1218 

1266 

1314 

1362 

1414 

1474 

1500 
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RAW SEQUENCE LISTING DATE; 10/22/2004 

PATENT APPLICATION: US/10/787 , 267A TIME : 12 : 50 : 01 



Input Set : A:\GC687-3-Dl-seqlist-corr.txt 
Output Set: N:\CRF4\10222004\J787267A.raw 

192 165 170 175 

193 Val Pro Ala Val Leu Trp Ala Phe Cys Trp Trp Val Leu Val Lys Val 

180 185 190 

195 Lys Pro Ser Gin Val Asn Trp Leu Ser Glu Asn Glu Lys Ala Ala Leu 

196 195 200 205 

197 Gin Ala Gin Leu Glu Ser Glu Gin Gin Gly lie Lys Ala Val Arc Asn 

198 210 215 220 

199 Tyr Gly Glu Ala Phe Arg Ser Arg Asn Val lie Leu Leu Cys Met Gin 

200 225 230 235 240 

Tyr Phe Ala Trp Ser He Gly Val Tyr Gly Phe Val Leu Trp Leu Pro 

245 250 255 

Ser lie lie Arg Ser Gly Gly Val Asn Met Gly Met Val Glu Val Gly 

260 265 270 



201 

202 

203 

204 



205 Trp Leu Ser Ser Val Pro Tyr Leu Ala Ala Thr lie Ala Met lie Val 

O H - _ _ 



206 



275 



280 



285 



207 Val Ser Trp Ala Ser Asp Lys Met Gin Asn Arg Lys Leu Phe Val Tru 



208 

209 



290 295 300 

Pro Leu Leu Leu He Gly Gly Leu Ala Phe lie Gly Ser Trp Ala Val 

210 305 310 315 320 

211 Gly Ala Asn His Phe Trp Ala Ser Tyr Thr Leu Leu Val lie Ala Asn 

212 325 330 335 

213 Ala Ala Met Tyr Ala Pro Tyr Gly Pro Phe Phe Ala lie lie Pro Glu 

214 340 345 35 Q 

215 Met Leu Pro Arg Asn Val Ala Gly Gly Ala Met Ala Leu lie Asn Ser 

216 355 360 365 

217 Met Gly Ala Leu Gly Ser Phe Phe Gly Ser Trp Phe Val Gly Tyr Leu 

218 370 375 380 

219 Asn Gly Thr Thr Gly Ser Pro Ser Ala Ser Tyr lie Phe Met Gly Val 

220 385 390 395 400 

221 Ala Leu Phe Ala Ser Val Trp Leu Thr Leu lie Val Lys Pro Ala Asn 

222 405 410 415 

223 Asn Gin Lys Leu Pro lie Gly Ala Arg His Ala 



224 



420 



425 



226 <2 10 > SEQ ID NO: 3 

227 <211 > LENGTH: 1775 

228 <212 > TYPE: DNA 

229 <2 13 > ORGANISM: Unknown 

231 <220> FEATURE: 

232 <223 > OTHER INFORMATION: environmental source 

234 <22 0 > FEATURE: 

235 <221> NAME/KEY: CDS 

236 <222> LOCATION: (214 )... (1491) 

238 <400> SEQUENCE: 3 

239 ggcaatttgc ggtgtttttt ccgcaggacg ttcatcgtcc ggcctgtatt catcaacggc 

240 cctgcgctat tcgcaaagtg gtggtgaaaa taccgctgcg ttatttaacg cccaataagc 

241 aacaccgagt ttataaccct gaacgacacg gctgcgggcc tgtgtagacg cccctacgcc 

242 ttaacaccac taaatgactc tacaggtgta tat atg aat aca gcc tct gtt tct 

Met Asn Thr Ala Ser Val Ser 

244 1 5 



60 

120 

180 

234 
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246 gtc 

247 Val 

248 

250 ccg 

251 Pro 

252 

254 ate 

255 He 

256 40 

258 gcc 

259 Ala 

260 

262 ttc 

263 Phe 

264 

266 aaa 

267 Lys 

268 

270 acc 

271 Thr 

272 

274 etc 

275 Leu 

276 120 

278 age 

279 Ser 

280 

282 ate 

283 lie 

284 

2 86 tgg 

287 Trp 

288 

290 geg 

291 Ala 

292 

294 cgt 

295 Arg 

296 200 

298 ate 

299 lie 

300 

302 ege 

303 Arg 

304 



acc 

Thr 

cct 

Pro 

25 

gcc 

Ala 



caa 

Gin 

10 

att 

lie 

ttc 

Phe 



teg atg 
Ser Met 

ttg cag 
Leu Gin 

ttc ate 
Phe lie 
90 

999 ctg 
Gly Leu 
105 

ggc cgt 
Gly Arg 

aac tgg 
Asn Trp 

atg ttc 
Met Phe 



ate 

lie 

ctg 

Leu 

185 

cca 

Pro 



ate 

lie 

170 

teg 

Ser 

caa 

Gin 



age cag 
Ser Gin 

ett att 
Leu lie 

gcc atg 
Ala Met 

gcc ggg 
Ala Gly 
60 

gta ccc 
Val Pro 
75 

99 t tgg 
Gly Trp 

gtc aeg 
Val Thr 

ttc ega 
Phe Arg 

ttc ccg 
Phe Pro 
140 
gtg ccg 
Val Pro 
155 

acc gcc 
Thr Ala 



geg ate ccc 
Ala lie Pro 
15 

acc tgc att 
Thr Cys lie 
30 

ccc ggc ggc 
Pro Gly Gly 
45 

ttg gcc ggc 
Leu Ala Gly 

ggc ggc aag 
Gly Gly Lys 

teg ttg ttg 
Ser Leu Leu 
95 

aat cag tat 
Asn Gin Tyr 
110 

age ggc atg 
Ser Gly Met 
125 

gac aag gaa 
Asp Lys Glu 

ate gcc ggc 
lie Ala Gly 



aaa aeg 
Lys Thr 

aac gcc 
Asn Ala 



306 ttc 

307 Phe 

308 

310 att 



ttc tac 
Phe Tyr 
250 
etc aag 



ctg gtc 
Leu Val 

gag gcc 
Glu Ala 

ctg cac 
Leu His 
220 
teg ctg 
Ser Leu 
235 

cag acc 
Gin Thr 

ggg etc 



tgg gac tgg 
Trp Asp Trp 
175 

gtg atg gtg 
Val Met Val 
190 

aaa agg att 
Lys Arg lie 
205 

gac gaa cag 
Asp Glu Gin 

cgt egg gtg 
Arg Arg Val 

999 ata tac 
Gly lie Tyr 
255 

acc aac ggc 



aaa tta ege 
Lys Leu Arg 

att tcc tat 
lie Ser Tyr 

atg gac gat 
Met Asp Asp 
50 

ggt att ttc 
Gly lie Phe 
65 

ctg geg gtg 
Leu Ala Val 
80 

gcc tgg geg 
Ala Trp Ala 

caa ttg ctg 
Gin Leu Leu 

ctg egg tgg 
Leu Arg Trp 
130 

ege ggg ege 
Arg Gly Arg 
145 

ate ett acc 
lie Leu Thr 
160 

ege atg ctg ttc 
Arg Met Leu Phe 



tgg ttg 
Trp Leu 
20 

atg gac 
Met Asp 
35 

gaa ctg 
Glu Leu 

ttt ate 
Phe lie 

tac ggc 
Tyr Gly 

gtg att 
Val lie 
100 
ttc ctg 
Phe Leu 
115 

gtg ctg 
Val Leu 

gcc aac 
Ala Asn 

gca ccg 
Ala Pro 



aga ata gtg 
Arg lie Val 

egg gtg aac 
Arg Val Asn 



ggc 

Gly 

ggt 

Gly 

aac 

Asn 

85 

tcc 

Ser 



ate acc 
lie Thr 
55 

tat ctg 
Tyr Leu 
70 

ggc aag 
Gly Lys 

gtg ctg 
Val Leu 



ct g tgg tat 
Leu Trp Tyr 



teg cag 
Ser Gin 

ttg ctg 
Leu Leu 
225 
ctg ggc 
Leu Gly 
240 

ggc tac 
Gly Tyr 



geg 

Ala 

210 

ate 

lie 

gac 

Asp 

acc 

Thr 



ttc 

Phe 

195 

gaa 

Glu 



ctg 

Leu 

180 

acc 

Thr 

aaa 

Lys 



aat atg gag 



aaa ggc 
Lys Gly 

aaa ate 
Lys lie 

ctg tgg 
Leu Trp 
260 
cag gtc 



ege ttc gcc 
Arg Phe Ala 

acc atg ate 
Thr Met lie 
135 

gcc ate gtc 
Ala lie Val 
150 

ctg tcc ggc 
Leu Ser Gly 
165 

gtc gag ggc 
Val Glu Gly 

ate age aac 
lie Ser Asn 

gat tat ctg 
Asp Tyr Leu 
215 

aaa aeg gtg 
Lys Thr Val 
230 

a tg tgg aag 
Met Trp Lys 
245 

ctg ccg acc 
Leu Pro Thr 

ggg atg ctg 



282 



330 



378 



426 



474 



522 



570 



618 



666 



714 



762 



810 



858 



906 



954 



1002 



1050 
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